Verb polysemy and frequency effects in thematic fit modeling
نویسندگان
چکیده
While several data sets for evaluating thematic fit of verb-role-filler triples exist, they do not control for verb polysemy. Thus, it is unclear how verb polysemy affects human ratings of thematic fit and how best to model that. We present a new dataset of human ratings on high vs. low-polysemy verbs matched for verb frequency, together with high vs. low-frequency and well-fitting vs. poorly-fitting patient rolefillers. Our analyses show that low-polysemy verbs produce stronger thematic fit judgements than verbs with higher polysemy. Rolefiller frequency, on the other hand, had little effect on ratings. We show that these results can best be modeled in a vector space using a clustering technique to create multiple prototype vectors representing different “senses” of the verb.
منابع مشابه
Measuring Thematic Fit with Distributional Feature Overlap
In this paper, we introduce a new distributional method for modeling predicateargument thematic fit judgments. We use a syntax-based DSM to build a prototypical representation of verb-specific roles: for every verb, we extract the most salient second order contexts for each of its roles (i.e. the most salient dimensions of typical role fillers), and then we compute thematic fit as a weighted ov...
متن کاملA Robust and Extensible Exemplar-Based Model of Thematic Fit
This paper presents a new, exemplar-based model of thematic fit. In contrast to previous models, it does not approximate thematic fit as argument plausibility or ‘fit with verb selectional preferences’, but directly as semantic role plausibility for a verb-argument pair, through similaritybased generalization from previously seen verb-argument pairs. This makes the model very robust for data sp...
متن کاملFitting, Not Clashing! A Distributional Semantic Model of Logical Metonymy
compute thematic fit for pairs relying only on distributional information (no information about semantic types) compare thematic fit differences across conditions and processing cost differences (high processing cost → low thematic fit, corresponding to 1-thematic fit in the model) verify if the computational model yields the same main effects and pairwise differences reported by t...
متن کاملSynergetic Properties of Chinese Verb Valency
This paper analyses the 500 most frequent verbs in contemporary Chinese and investigates their synergetic properties. The results show that the rank-frequency distributions of both valency and polysemy abide by a power-law distribution and that valency and polysemy of these verbs abide by the Good distribution and the positive negative binomial distribution respectively. Statistical analysis in...
متن کاملThematic fit evaluation: an aspect of selectional preferences
In this paper, we discuss the human thematic fit judgement correlation task in the context of real-valued vector space word representations. Thematic fit is the extent to which an argument fulfils the selectional preference of a verb given a role: for example, how well “cake” fulfils the patient role of “cut”. In recent work, systems have been evaluated on this task by finding the correlations ...
متن کامل